Beyond the CPU: Hardware Performance Counter Monitoring on Blue Gene/Q

نویسندگان

  • Heike McCraw
  • Daniel Terpstra
  • Jack J. Dongarra
  • Kris Davis
  • Roy G. Musselman
چکیده

The Blue Gene/Q (BG/Q) system is the third generation in the IBM Blue Gene line of massively parallel, energy efficient supercomputers that increases not only in size but also in complexity compared to its Blue Gene predecessors. Consequently, gaining insight into the intricate ways in which software and hardware are interacting requires richer and more capable performance analysis methods in order to be able to improve efficiency and scalability of applications that utilize this advanced system. The BG/Q predecessor, Blue Gene/P, suffered from incompletely implemented hardware performance monitoring tools. To address these limitations, an industry/academic collaboration was established early in BG/Q’s development cycle to insure the delivery of effective performance tools at the machine’s introduction. An extensive effort has been made to extend the Performance API (PAPI) to support hardware performance monitoring for the BG/Q platform. This paper provides detailed information about five recently added PAPI components that allow hardware performance counter monitoring of the 5D-Torus network, the I/O system and the Compute Node Kernel in addition to the processing cores

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance Counter Monitoring for the Blue Gene/Q Architecture

With the increasing scale and complexity of large computing systems the effort of performance optimization grows more and more and so does the responsibility of performance analysis tool developers. To be of value to the High Performance Computing (HPC) community, performance analysis tools have to be customized as quick as possible in order to support new processor generations as well as chang...

متن کامل

Performance Counter Monitoring for the Blue Gene / Q Architecture

This quarter’s newsletter features the performance area, which has a broad focus including autotuning, performance modeling, end-to-end performance measurement, and tool integration. We aim to extend and integrate mature, robust performance measurement technologies and develop a comprehensive performance data management framework that can be used by other areas of the SUPER project. End-to-end ...

متن کامل

Non-Determinism and Overcount on Modern Hardware Performance Counter Implementations – Extended

Ideal hardware performance counters provide exact deterministic results. Real-world performance monitoring unit (PMU) implementations do not always live up to this ideal. Events that should be exact and deterministic (such as retired instructions) show run-to-run variation and overcount on x86 64 machines, even when run in strictly controlled environments. These effects are non-intuitive to cas...

متن کامل

Blue Gene/L performance tools

performance tools X. Martorell N. Smeds R. Walkup J. R. Brunheroto G. Almási J. A. Gunnels L. DeRose J. Labarta F. Escalé J. Giménez H. Servat J. E. Moreira Good performance monitoring is the basis of modern performance analysis tools for application optimization. We are providing a variety of such performance analysis tools for the new Blue Genet/L supercomputer. Those tools can be divided int...

متن کامل

Parallel deep neural network training for LVCSR tasks using blue gene/Q

While Deep Neural Networks (DNNs) have achieved tremendous success for LVCSR tasks, training these networks is slow. To date, the most common approach to train DNNs is via stochastic gradient descent (SGD), serially on a single GPU machine. Serial training, coupled with the large number of training parameters and speech data set sizes, makes DNN training very slow for LVCSR tasks. While 2nd ord...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013